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(54) Method for generating anti-aliased text and line graphics in compressed document images 



(57) A method and system for storing and generat- 
ing anti-aliased text and lineart data from compressed 
document image files, using a MRC model that repre- 
sents the image as an ordered set of mask/image pairs 



at resolutions appropriate to the content of each layer. 
The method and system provide the ability to generate 
for anti-aliased text data to improve appearance at both 
high and low resolution, and to avoid baseline jitter of 
compressed tokens. 
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Description 

[0001] This invention relates to a method and system 
for storing and generating anti-aliased text or lineart 
from compressed document image files. More specifi- 5 
cally, the invention uses a Mixed Raster Content (MRC) 
model that represents the image as an ordered set or 
mask/image pairs at resolutions appropriate to the con- 
tent of each layer. When using token compression, use 
of anti-aliased text or lineart improves text and lineart 10 
image appearance for both low and high resolution by 
smoothing edges and avoiding token baseline jitter. 
[0002] Uncompressed grayscale or color scanned 
document images contain too much data for convenient 
on-line storage and retrieval. Lossless compression of 15 
a 300 ppi grayscale scanned image, using universal 
compression such as Lempel-Ziv, typically causes only 
a small reduction in stored data. Thus, for example, an 
8 MB uncompressed image may be minimally shrunk to 
4 to 7 MB after lossless compression. Compression is 20 
only minimal because most of the image data is pro- 
duced as a result of scanner noise in the 3 or 4 least 
significant bits. Thus, some lossy compression is nec- 
essary. However, due to conflicting application require- 
ments, there is no universal method that will fit all situ- 25 
ations. 

[0003] For example, suppose the requirement is that 
compression must be visually lossless. The amount of 
achievable compression is limited, and depends strong- 
ly on the scanning resolution. For example, at 300 ppi, 30 
simple hierarchical vector quantization (HVQ) provides 
a guaranteed 4x compression, with perhaps a 7x typical 
compression after further Lempel-Ziv coding. However, 
even at 8x compression, grayscale images are pro- 
duced that compress to about 1 .0 MB/page, which is too 35 
much for many applications. 

[0004] To get a reasonable (but not lossless) image, 
at significantly better compression, a MRC approach 
may be used, in which the image is stored as ordered 
pairs of (mask, image) layers. Mixed Raster Content *o 
(MRC) is one approach to satisfy the compression 
needs of differing types of data. MRC involves separat- 
ing a composite image into a plurality of masks, and sep- 
arately applying an appropriate compression technique 
to each image mask. The document is represented by 45 
a pixel map that is decomposed into a multiple mask 
representation. 

[0005] The masks allow the image to be painted 
through, and the ordering is necessary because the last 
pixel painted in each location is the one that is apparent so 
to a viewer. In the most simple but non-trivial example, 
two (mask, image) pairs are used. The first layer is the 
background image, represented as a low-resolution 
gray or color image, and its mask is taken to cover the 
entire image. The second layer is the text/lineart layer, 55 
represented by a binary high-resolution, e.g., a 300 ppi 
or greater mask and a very low resolution foreground 
color image that is painted through the high resolution 



mask. The foreground color image can be at even lower 
resolution than 100 ppi. 

[0006] It is possible to conform with the MRC format, 
and use a 300 ppi text or lineart mask, compressed loss- 
ily using connected component tokens, and a 100 ppi 
background image compressed with JPEG or wavelets. 
It is also possible to use a third (mask, image) layer pair 
for higher resolution embedded color images that are 
located by a segmentor. This third image layer may also 
be compressed using JPEG or wavelets. A similar ap- 
proach has also been used where text or lineart is also 
compressed lossily using binary image tokens and 
wavelet compression is used on the background image. 
[0007] For these MRC formats, the text or lineart 
mask can also be compressed losslessly using Group4, 
Limpel-Ziv, or arithmetic coding. However, there are 
several problems associated with the current use of a 
binary text or lineart mask. First, regardless of the com- 
pression method used on the text or lineart mask, the 
text or lineart, when rendered, has stair-steps on nearly 
horizontal or vertical lines. The text or lineart image qual- 
ity suffers from severe aliasing when sub-sampled. The 
poor quality is also evident when viewed at a higher res- 
olution on a cathode ray tube (CRT), where the charac- 
ter boundaries display the noisiness of individual pixels. 
Another weakness of the conventional method is the 
baseline jitter problem when connected component to- 
kens are used. It is very difficult to avoid visible baseline 
jitter when tokens are substituted; because the vertical 
alignment of individual characters is susceptible to the 
noise on character boundaries introduced by threshold- 
ing the grayscale character image to binary. 
[0008] These image quality deficits, stemming from 
the binary character of the text or lineart, make the con- 
ventional MRC format unacceptable for applications re- 
quiring that the compression loss is not observable, 
such as for a bookscanner where a visually lossless ar- 
chival gray image must be saved. 
[0009] The exemplary embodiments of the present in- 
vention ameliorate or significantly resolve these prob- 
lems with conventional methods and systems of storing 
and generating image data in rendering apparatuses at 
a relatively small cost in additional storage. Such prob- 
lems can be rectified to some extent, and at some cost, 
by allowing use of a grayscale text or lineart mask. This 
can be done within the MRCformat in a number of ways. 
Alternatively, another (mask, image) pair may be added 
for the boundary pixels, i.e., a pixel that has at least one 
of its eight nearest neighbors being of a color opposite 
to the pixel's color, of the text or lineart mask. Addition- 
ally, methods to renderthese boundary pixels, e.g., with- 
out using an explicit layer pair, may be implemented. 
Such solutions can be made with only a relatively small 
incremental cost in storage. 

[0010] The present invention provides a system and 
method for storing and generating anti-aliased text or 
lineart from document image files to improve the ap- 
pearance of rendered text or lineart at both low and high 
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resolutions. Each exemplary embodiment of the inven- 
tion achieves the goals with minimal cost in additional 
storage and rendering complexity. 
[0011] The exemplary embodiments of the invention 
generate an anti-aliased grayscale version of a text or s 
lineart mask used within an ordered set of mask/image 
pairs. The grayscale data can be stored with either loss- 
less compression or with lossy compression. 
[001 2] The exemplary embodiments of the present in- 
vention may also be applied to methods for anti-aliasing 
compressed representations of color images. To inter- 
polate between the low resolution foreground and back- 
ground color images, three values, e.g., for red, blue and 
green color separations, must be available for each 
boundary pixel. 

[0013] All exemplary embodiments involve either 
generating or saving pixel values for pixels near the 
boundaries of the text or lineart mask, and using these 
values in the rendering process to smooth the visual ap- 
pearance of the boundaries. If the data is compressed, 
the compressed representation for the pixels associated 
with the text or lineart mask and its boundary can be 
either saved as a set of full page images, or as a set of 
tokens that typically represent connected components 
in the foreground image. 

[0014] The exemplary embodiments of the present in- 
vention separately provide improved display and print 
rendering of MRC-compressed images. 
[001 5] The exemplary embodiments of the present in- 
vention separately provide means for providing anti-ali- 
ased text or lineart data while minimizing costs of addi- 
tional storage and rendering complexity. 
[001 6] It is preferred that the exemplary embodiments 
of the invention are implemented in conjunction with the 
processing of images from documents that contain dif- 
ferent types of data, e.g., MRC. 

[0017] The first exemplary embodiment uses high 
resolution grayscale scans. The second through fourth 
embodiments may be implemented with a combination 
of a high resolution binary scan and a low resolution 
grayscale scan. The fifth embodiment uses very high 
resolution binary scans. 

[0018] The first through fourth embodiments provide 
methods that use an explicit or implicit extra M RC mask/ 
image pair for the boundary pixels of the text or lineart 
mask. Methods according to the fifth embodiment gen- 
erate the gray boundary pixels at high resolution directly 
from a very high resolution binary representation of the 
text or lineart mask. 

[0019] The second andthird exemplary embodiments 
classify the boundary pixels of each type (ON and OFF) 
depending on their neighboring pixels in the text or lin- 
eart mask image, and substitute a specific pixel value 
for each type globally. Additionally, second exemplary 
embodiment uses an adaptive algorithm that computes 
the best pixel values from the image; the other uses pre- 
computed pixel values for similar images. Additional 
storage for this method is insignificant or nil. 



4 

[0020] The third and fourth embodiments provide 
methods that can be used with current scanners, e.g., 
high resolution is 300 ppi binary, and current MRC tor- 
mats, without any additional storage requirements. 
[0021] The fourth method does not use information 
about the neighborhood of the boundary pixel (in the 
mask), but instead just substitutes a single global value 
for boundary pixels of each type. 

[0022] The fifth exemplary embodiment substitutes 
the original pixels directly into the boundary. The quality 
and compressibility are controlled by the quantization 
applied to the stored pixels. The fifth method requires a 
very high resolution binary scan to generate the text or 
lineart mask at 600 ppi or greater. It then generates the 
anti-aliasing boundary values at 300 ppi by filtering and 
down-sampling. One of the simplest methods for doing 
so is the scale-to-gray conversion. 
[0023] These and other objects of the present inven- 
tion will be readily appreciated and understood from 
consideration of the following detailed description of ex- 
emplary embodiments of this invention , when taken with 
the accompanying drawings, in which: 

Figure 1 shows a high resolution original scanned 
image; 

Figure 2 shows the standard MRC version of the 
image shown in Figure 1 ; 

Figure 3 shows an original scanned image after 4x 
grayscale sub-sampling; 

Figure 4 shows a cleaned M RC version of the image 
shown in Figure 3 with binary mask after 4x sub- 
sampling; 

Figure 5 shows a schematic illustration of an image 
rendering system that implements each of the first 
through fifth exemplary embodiments; 
Figure 6 shows a flowchart outlining a method of 
image rendering in accordance with the first exem- 
plary embodiment; 

Figure 7 shows an image rendered using a binary 
mask; 

Figures 8-11 show a set of images, corresponding 
to Figure 7, rendered using selection of original pix- 
els on the boundary, quantized to two, three, four 
and frve bits in accordance with the first exemplary 
embodiment; 

Figure 12 shows an original image, as scanned, us- 
ing all eight bits, and corresponding to Figures 8-1 2; 
Figures 13-15 show rendering results where the 
boundary pixels are quantized to two, three and four 
most significant bits, respectively; 
Figure 16 shows a flowchart outlining a method of 
image rendering in accordance with the second ex- 
emplary embodiment; 

Figure 17 shows a chart of histograms of interior 
boundary pixels for, 0, 1 , 2 and 3 four-neighbors of 
opposite (OFF) color; 

Figure 18 shows a chart of histograms of exterior 
boundary pixels for, 0,1,2 and 3 four-neighbors of 
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opposite (ON) color; 
Figure 19 shows a chart of the median pixel values 
found from the unsmoothed histograms for the in- 
terior boundary pixels; 

Figure 20 shows a chart of the median pixel values 5 
found from the unsmoothed histograms for the ex- 
terior boundary pixels; 

Figure 21 shows a magnified text fragment that is 
produced by substituting the mean values shown in 
Figures 1 9 and 20 globally for the boundary pixels io 
using 4-connected median adapted pixel values in 
accordance with the second exemplary embodi- 
ment; 

Figure 22 shows the result of the 4x sub-sampled 
image corresponding to Figure 21 ; 15 
Figure 23 shows the magnified text fragment that is 
produced by substituting the mean values shown in 
Figures 19 and 20 globally for the boundary pixels 
using 8-connected median adapted pixel values in 
accordance with the second exemplary embodi- 20 
ment; 

Figure 24 shows the result of the 4x sub-sampled 
image corresponding to Figure 23; 
Figure 25 shows a flowchart outlining a method of 
image rendering in accordance with the third exem- 25 
plary embodiment; 

Figure 26 shows the magnified text fragment that is 

produced with a grayscale mask boundary using a 

non-adapted linearly interpolated approximation to 

4-connected median pixel values in accordance 30 

with the third exemplary embodiment; 

Figure 27 shows the result of the 4x sub-sampled 

image corresponding Figure 26; 

Figure 28 shows a flowchart outlining a method of 

image rendering in accordance with the fourth ex- 35 

emplary embodiment; 

Figure 29 shows the magnified text fragment that is 
produced with all pixels of the interior boundary set 
to a single threshold-dependent value, and ail pixels 
of the exterior boundary set to another threshold- *o 
dependent value in accordance with the fourth ex- 
emplary embodiment; 

Figure 30 shows the result of the 4x sub-sampled 
image corresponding Figure 29; 

Figure 31 shows a flowchart outlining a method of *s 
image rendering in accordance with the fifth exem- 
plary embodiment; 

Figure 32 shows the magnified text fragment that is 
produced with all pixels of the interior boundary set 
to a single threshold-dependent value, and all pixels so 
of the exterior boundary set to another threshold- 
dependent value in accordance with the fourth ex- 
emplary embodiment; and, 

Figure 33 shows the result of the 4x sub-sampled 
image corresponding Figure 29. 55 

[0024] Each of the exemplary embodiments of the in- 
vention assigns gray values to the boundary pixels of 



6 

the text or lineart mask. Each of these exemplary em- 
bodiments provides methods for grayscale anti-aliasing 
that are extended to color images, for text or lineart 
masks that are compressed either as images or as to- 
kens. It is foreseeable that color rendering of anti-ali- 
ased tokens will have a significant affect on scanning 
devices. 

[0025] Presently, scanning systems typically perform 
a 300 ppi binary scan for high resolution foreground 
components, and a 100 ppi color scan for foreground 
and background color images. The present invention 
provides the capability of using a grayscale value for the 
boundary pixel to render, by interpolation, color pixels 
when the text or lineart mask is compressed either as a 
full image or in tokens, and using any of the five exem- 
plary embodiments of the invention. 
[0026] The present invention is of use in any situation 
where it is desirable or necessary to render the text and 
lineart at greater than 300 binary ppi in an image. This 
is of particular use when printing, to eliminate obvious 
stair-steps in lines that are nearly horizontal or vertical, 
when viewing a magnified image, e.g., a 300 ppi image 
on an 80 ppi CRT, or when viewing a highly sub-sampled 
image, as when attempting to image an entire page scan 
in a window on a CRT display. When lossy token com- 
pression is used for the text or lineart mask, implemen- 
tation of the exemplary embodiments also improves the 
appearance of printed and displayed text or lineart by 
reducing the baseline jitter that is difficult to handle sat- 
isfactorily when using binary tokens at 300 - 400 ppi. 
. [0027] It is now recognized that a MRC image archi- 
tecture is flexible enough to provide high image quality 
with excellent compression, and standardization efforts 
are under way. The present invention fits well into the 
MRC architecture. As explained below, some of the ex- 
emplary embodiments utilize an additional image layer, 
others rely on the use of higher resolution binary masks; 
others provide an improvement in rendering from cur- 
rent MRC compressed images, altering either the spe- 
cific set of MRC layers or the data that is stored in them. 
[0028] Exemplary embodiments of the invention may 
be used in conjunction with a 300 ppi grayscale scan 
and a 1 00 ppi color scan. The high resolution scan, i.e., 
300 ppi, is performed in grayscale because this permits 
the use of adaptive thresholding to generate the bina- 
rized text or lineart mask. If instead a binary high reso- 
lution scan is used, where the scanner has performed 
a global thresholding, the text or lineart mask can be 
flawed in regions where the background is incorrectly 
thresholded. Thus, it is preferred that a file format has 
the high resolution grayscale values for boundary pixels 
available. 

[0029] It is preferred that the file format for storing the 
foreground mask as tokens is extensible. In addition to 
a token index and location, which are currently saved, 
the color, character and font ID, and anti-aliasing infor- 
mation can also be stored with each token instance. 
[0030] The importance of anti-aliasing is most evident 
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for document images viewed on a display. Without anti- 
aliasing, at very high resolution such as the full resolu- 
tion of the original binary scan, edge pixellation effects 
are quite noticeable. And at low resolution, e.g., 3-4x 
sub-sampling, the aliasing due to sub-sampling of a bi- 5 
nary mask severely compromises legibility of small text. 
[0031] Therefore, the present invention is directed at 
solving problems of aliasing, noisy characters, and 
baseline token jitter described above. Accordingly, the 
system and method allow images to be displayed at both 
high and low (sub-sampled) resolution, without the de- 
fects evident from thresholding. The exemplary embod- 
iments of the invention provide a variety of approaches 
for achieving this capability in the context of a layered 
MRC format. The exemplary embodiments provide im- 
proved visual quality, achievable compression, ease of 
implementation, and speed of decompression. There- 
fore, the exemplary embodiments achieve improved 
rendering. 

[0032] In the detailed explanation of the exemplary 
embodiments, high resolution data means a 300 - 400 
ppi binary or grayscale image. Very high resolution bi- 
nary data refers to a binary image at 600 ppi or greater. 
[0033] Images can be transformed between high res- 
olution gray and very high resolution binary, and vice- 
versa. In the up-resolution direction, i.e., high gray res- 
olution to very high binary resolution, an interpolate 
gray-to-gray expansion is followed by thresholding to bi- 
nary. In the down -resolution direction, i.e., very high bi- 
nary resolution to high gray resolution, a scale-to-gray 
conversion is used, which is typically a low-pass filter 
followed by sub-sampling. 

[0034] To implement some of the exemplary embodi- 
ments, e.g., the first and fifth embodiments, it is neces- 
sary to acquire either a very high resolution binary 
scanned image (fifth embodiment) or a high resolution 
grayscale scanned image (first embodiment) with at 
least 2 bits per pixel (bpp) and preferably 4 bpp. Be- 
cause these images can be interconverted, each of the 
first and fifth embodiments requires either a high reso- 
lution gray or very high resolution binary image because 
the method and system can acquire the necessary data 
from either type of image data. For example, if a method 
requires a very high resolution binary image, it can di- 
rectly obtain the image data or obtain it from a scanned 
high resolution gray image. 

[0035] However, the second through fourth embodi- 
ments do not require anything beyond standard high 
resolution binary scanned image data. 
[0036] All embodiments generate or save pixel values 
for pixels near the boundaries of the text or lineart mask, 
and use these values in the rendering process to smooth 
the visual appearance of the boundaries. There is no 
need to explicitly store a mask that identifies the bound- 
ary pixels of the text or lineart mask, because these are 
easily derived from the text or lineart mask itself. 
[0037] A compressed representation for pixels asso- 
ciated with a text or lineart mask and its boundary can 



either be saved as a set of full page images, or as a set 
of tokens that typically represent connected compo- 
nents in the foreground image. Although specific factors 
affect anti-aliasing a token representation of a text or 
lineart mask, the method of compressing the text or lin- 
eart mask does not conceptually affect the image qual- 
ity. 

[0038] During description of the five exemplary em- 
bodiments, various experiments were developed to pro- 
vide examples to exemplify the operation and resulting 
benefits of the embodiments. All examples relate to an 
image of text in 8 pt. font, that was scanned at 300 ppi 
and 8 bpp. High resolution display examples use a sin- 
gle word of the text. Low resolution examples use a larg- 
er block consisting of about 10 lines of text. High reso- 
lution display examples are magnified about 7x, relative 
to the size they would have if rendered at a scan reso- 
lution on a 300 ppi printer. At this resolution, the image 
data are about twice as big as they would appear at full 
(300 ppi) resolution on an 80 ppi monitor. The low res- 
olution images are derived by sub-sampling the full res- 
olution image by 4x, and then printing it at a size roughly 
comparable to both the size on the original scanned 
page and the direct display on an 80 ppi screen. Each 
pixel in the sub-sampled image is thus rendered as ap- 
proximately a 4x4 block of halftoned printed pixels. 
[0039] In the experiments implementing MRC, a glo- 
bally adapted threshold value (1 52) is used for extract- 
ing the binary text or lineart mask. The method sub-sam- 
ples pixels from the image at a specified rate, e.g., 8x, 
and forms a histogram of pixel values with overlapping 
bins. The lightest bin to have more than a given fraction 
of all the pixels, e.g. 15%, sets the background value. 
This background value is adjusted downward to deter- 
mine the global threshold to be used. 
[0040] Once the text or lineart mask has been identi- 
fied, the text or lineart mask boundary pixels are brought 
up to the general background level using a small (5x5) 
grayscale dilation. This grows the lighter background 
pixels through the boundary pixels, and it also lightens 
and smoothes the background. Superposing the text or 
lineart mask on this smoothed background gives the 
rendered MRC image, as it is currently done with a bi- 
nary text or lineart mask. 

[0041] Figure 1 shows the high resolution original 
scanned image and Figure 2 shows the standard MRC 
version, ff the original image was sub-sampled 4x and 
displayed on an 80 ppi monitor, the result is barely leg- 
ible. Figure 3 shows in print approximately what would 
be seen on a display screen. However, when the stand- 
ard MRC image with binarized mask is sub-sampled 4x, 
as shown in Figure 4, the results are not legible. 
[0042] In each of the exemplary embodiments except 
for the fifth embodiment the text or lineart mask is de- 
composed into two parts: interior and boundary. The in- 
terior part can be constructed from the binary text or lin- 
eart mask, by removing the ON boundary pixels, i.e., 
those ON pixels that are touching an OFF pixel. A mor- 
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phological erosion of the single mask using a 3x3 struc- 
turing element gives a simple implementation. However, 
the boundary part is composed of ail boundary pixels to 
the mask, both ON and OFF; namely, any pixel in the 
text or lineart mask that is adjacent, in an 8-connected 
sense, to a pixel of the opposite value. It is the union of 
(1 ) the pixels removed in constructing the interior part 
of the text or lineart mask and (2) the OFF pixels in the 
background that are adjacent to those boundary pixels. 
An implementation for construction of the boundary part 
performs the exclusive or operation, "XOR", of the ero- 
sion and dilation of the text or lineart mask, with both 
morphological operations using a 3x3 structuring ele- 
ment. 

[0043] The first through fourth exemplary embodi- 
ments set the interior pixels to black. However, each ex- 
emplary embodiment differs in how it handles the 
boundary pixels. 

[0044] Figure 5 illustrates a schematic diagram of an 
image rendering system 500 implementing the exem- 
plary embodiments of the present invention. Each of the 
exemplary embodiments may be implemented within 
the image rendering system 500. As shown in Figure 5, 
the image rendering system 500 includes a scanner 
51 0, an encoder 520, a decoder 530, memory 540. The 
scanner 51 0 is coupled to the encoder 520 and provides 
scanned image data to the encoder 520. The encoder 
520 is also coupled to the memory 540. The memory 
540 is also coupled to the decoder 530. The decoder 
may be incorporated in an image renderer, for example, 
a print engine with, e.g., a conventionally configured fly- 
ing spot raster output scanner, an image rendering de- 
vice for displaying an image on a display screen, etc. 
[0045] In accordance with a first exemplary embodi- 
ment, the scanner 51 0 scans an image to provide image 
data. The image data is provided to the encoder 520 
that encodes the image data. Such encoding may in- 
clude compression or other beneficial data manipula- 
tion. Encoding begins by identifying boundary pixels 
within the image data provided by the scanner 510. 
These boundary pixels, i.e., the actual scanned pixels 
on the boundary of the text or lineart mask, appropriately 
quantized, are saved in memory 540 by the encoder 520 
for use subsequently in rendering the image. 
[0046] In accordance with the first exemplary embod- 
iment of the invention, anti-aliasing of the text or lineart 
data begins with a special scanning scheme, e.g. , either 
binary very high resolution or grayscale high resolution, 
used to produce high resolution grayscale data, e.g., 
300 - 400 ppi. Encoding is not performed in an adaptive 
manner. Therefore, no global pixel value assignments 
are computed from the scanned image. There is no con- 
nectivity utilized. Therefore, the values of connected 
neighbors of a boundary mask pixel are not used in de- 
termining the pixel value assignments encoded data 
that is stored in memory 540. The individual derivation 
of grayscale boundary pixels is consistent with MRC in 
that the boundary pixels constitute a (mask, image) pair, 



where the mask is just the boundary mask described 
above, and the image consists of Os (or 1 s) everywhere 
except under the boundary mask, where the pixel value 
represents the actual value, quantized to some number 

5 of significant bits. The boundary pair resulting from en- 
coding according to the first exemplary embodiment is 
explicit. Therefore storage of an extra MRC boundary 
mask/image pair in memory 540 is required to subse- 
quently render the image. 

10 [0047] Figure 6 shows a flowchart indicating the flow 
of a method for providing anti-aliased text and lineart 
data in accordance with the first exemplary embodi- 
ment. The method begins in step S600 and control pro- 
ceeds to step S61 0. In step S61 0, a special scan is per- 

15 formed to generate the grayscale image data at high 
resolution. Control then proceeds to step S620. In step 
S620, the grayscale image data is binarized to produce 
high resolution binary data using a set threshold. Control 
then proceeds to step S630, in which the boundary pix- 

20 els of the binarized image data are identified and control 
proceeds to step S640. In step S640, the grayscale val- 
ues of the boundary pixels identified in step S630 are 
stored and control proceeds to step S650. In step S650, 
the high resolution binary pixel data and the stored gray- 

25 scale values of the boundary pixels are stored. Control 
then continues to step S660. In step S660, the high res- 
olution binary pixel data and the grayscale values of the 
boundary pixels values are used to render the image 
and control proceeds to step S670. In step S670, the 

30 method ends. 

[0048] During individual derivation of grayscale 
boundary pixels, the more bits retained, the more similar 
a subsequently rendered image will be to the original, 
scanned image, and the more expensive the storage. 

35 To illustrate, Figures 7-12 show the rendering results 
with some magnification, as in Figures 1 and 2. Figure 
7 shows an image rendered using a binary mask on 
cleaned background. Figures 8-11 show an image ren- 
dered in which the boundary pixels are displayed, quan- 
go tized to two : three, four and five most significant bits, 
respectively. Figure 12 shows the original scanned im- 
age. 

[0049] It is also instructive to see the images of text 
when treated this way and then subsampled by 4x. Fig- 

45 ures 13-15 show rendering results where the boundary 
pixels are quantized to two, three and four most signifi- 
cant bits, respectively. Comparing with the use of a bi- 
nary mask in Figure 7, it is evident that anti-aliasing is 
provided by rendering the boundary region in gray 

50 tones. Further, much of the anti-aliasing effect is provid- 
ed by the two most significant bits. When the image is 
rendered on a display screen, the difference amongst 
these three images is even more apparent, but even 
there, the use of three and four most significant bits give 

55 diminishing returns. 

[0050] In the data file holding the image data, only the 
original single foreground mask needs to be stored be- 
cause the interior and boundary masks are easily de- 
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rived from it, as described above. The boundary pixels, 
when quantized to two or three bits, are stored in a full 
boundary image, where all non-boundary pixels are giv- 
en the value zero (or max). This image should compress 
reasonably well using Lempel-Ziv. Alternatively, since 
the boundary pixel locations are known from the mask, 
it is not necessary to store the boundary pixels within a 
full image. Instead, the boundary pixels can be stored 
in a packed sequential array, in raster scan order, and 
their location in the image can be inferred from the fore- 
ground mask. 

[0051 ] Figure 1 6 shows a flowchart indicating the flow 
of a method for providing anti-aliased text and lineart 
data in accordance with the second exemplary embod- 
iment. The method begins in step S1600 and control 
proceeds to step S1610. In step S1610, the image is 
scanned specially to determine the actual grayscale val- 
ues of the boundary pixels at high resolution. Control 
then proceeds to step S1620. In step S1620, the gray- 
scale image data is binarized to produce high resolution 
binary data using a set threshold. Control then proceeds 
to step S1630. In step S1630, the boundary pixels are 
identified and control proceeds to step S1640. In step 
S1640, the boundary pixels are separated into interior 
and exterior pixels, which are boundary pixels that are 
ON and OFF, respectively, in the text or lineart mask. 
Control then proceeds to step S1 650, in which each sub- 
set, e.g., interior and exterior pixels, are analyzed sep- 
arately to determine connectedness to determine an ap- 
propriate gray value for each of the boundary pixels. 
Performance of step S1 650 is based on the assumption 
that the grayscale value of the original boundary pixel is 
correlated with the number of connected pixels in the 
mask image of opposite (or same) color. Subsequently, 
control proceeds to step S1660, gray scale values are 
calculated for the boundary pixels and control proceeds 
to step S1670. In step S1670, the high resolution binary 
boundary pixel values and the derived grayscale values 
of the grayscale pixel data are stored in memory as com- 
pressed data. Control then proceeds to step S1680 in 
which the high resolution binary boundary pixel data and 
the derived grayscale values of the boundary pixels are 
used to render the image and control proceeds to step 
S1690. In step S1690, the method ends. 
[0052] Two pixels are 4-connected if their positional 
relation is such that a second pixel is positioned adja- 
cent to a first pixel to the immediate left, right, top or 
bottom of the first pixel. Two pixels are B-connected if 
their positional relation is such that the second pixel is 
positioned adjacent to the first pixel to the immediate 
left, right, top or bottom or to the immediate upper-left, 
upper-right, lower-left or lower-right of the first pixel. As 
mentioned above, an assumption is made that the gray- 
scale value of the original boundary pixel is correlated 
with the number of connected pixels in the mask image 
of opposite (or same) color. For example, an interior 
(ON) pixel with three 4-connected OFF pixels is expect- 
ed to be lighter than one with only one 4-connected OFF 



pixels. Therefore, each of the boundary pixels of each 
type is analyzed in the image to form an estimate of its 
gray value from the connectivity. This estimate is formed 
by building a histogram of pixel values for each type and 

5 for each connectivity, and taking the median pixel value. 
[0053] Step S1640 may be performed using either 
type-4 or type-8 connectivity Experiments have been 
performed using both type-4 connectivity (analyzing the 
four orthogonal pixels surrounding a sampled pixel) and 

10 type-8 connectivity (analyzing the four orthogonal pixels 
as well as the four diagonal pixels surrounding the sam- 
pled pixel. The differences in results between type-4 and 
type-8 are not large. However, the type-4 connectivity 
evaluation results are discussed below in more detail 

15 because they are easier to visualize. 

[0054] To illustrate the validity of the assumption of 
pixel value correlation with connectivity, Figures 17 and 
18 show a plot of histograms for interior and exterior 
boundary pixels respectively. Figure 17 shows the his- 

20 tograms of interior boundary pixels, for 0,1,2 and 3 four- 
neighbors of opposite (OFF) color. Figure 18 shows the 
histograms of exterior boundary pixels, for 0,1,2 and 3 
four-neighbors of opposite (ON) color. 
[0055] Four curves can be distinguished in each of 

25 Figures 17 and 1 8. In Figure 1 7, the interior histogram, 
these curves are for 0, 1,2, and 3 connected pixels of 
opposite (OFF) color. There are few cases where four 
connected pixels are OFF, and during analysis these 
pixel values are set to the threshold value used for con- 

30 structing the text or lineart mask. In Figure 1 8, the exte- 
rior histogram, the four curves are again for 0, 1 , 2 and 
3 connected pixels of opposite (ON) color. There are few 
cases with pixels with four ON 4-neighbors. During anal- 
ysis, these pixel values are also set to threshold value 

35 used for constructing the text or lineart mask. In the data 
shown in Figures 17 and 18, the threshold value used 
for constructing the text or lineart mask was 152. As a 
result, it should be recognized that there are some pixels 
in each histogram that cross over the threshold that was 

40 used to generate the mask. This cross over is merely 
an artifact of the use of a smoothing filter of length 7 that 
was applied to the histograms. 

[0056] The median pixel values are found from the 
(unsmoothed) histograms, and are shown for interior 
45 and exterior boundary pixels in Figures 19 and 20, re- 
spectively. In each of these Figures, the median pixel 
value is plotted against the number of mask neighbors 
of opposite color. 

[0057] Figure 21 shows a magnified text fragment that 
50 is produced by substituting the mean values shown in 
Figures 19 and 20 globally for the boundary pixels. Fig- 
ure 21 shows the MRC image with grayscale mask 
boundary using 4-connected median adapted pixel val- 
ues. The magnified text fragment shown in Figure 21 is 
55 noisier than the corresponding fragment of the original 
image but storing the image data only requires storing 
the ten median values in addition to the standard MRC 
data. Figure 22 shows the result of the 4x sub-sampled 
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image. 

[0058] Figure 23 also shows the magnified text frag- 
ment that is produced by substituting the mean values 
shown in Figures 19 and 20 globally for the boundary 
pixels. However, Figure 23 shows the MRC image with 
grayscale mask boundary using 8-connected median 
adapted pixel values. Figure 24 shows the result of the 
4x sampled image. Comparison of Figure 21 and Figure 
23 indicates that there is little apparent difference in im- 
age quality between the 4 and 8 connected versions, 
when viewed on a display. 

[0059] In a third exemplary embodiment, non-adap- 
tive, connectivity-dependent global pixel values are 
generated. If high resolution grayscale values are not 
available, a set of values may be derived from analysis 
on similar images. A standard scanning scheme, e.g., 
binary high resolution, is used to produce high resolution 
binary data, e.g., 300 - 400 ppi. The method is not adap- 
tive. Therefore, global pixel value assignments are not 
computed from the scanned image. However, connec- 
tivity is utilized. Therefore, the values of connected 
neighbors of a boundary mask pixel are used in deter- 
mining the pixel value assignments. The resulting 
boundary pair is implicit, which means the extra mask/ 
image pair is constructed to render the image. 
[0060] Figure 25 shows a flowchart indicating the flow 
of a method for providing anti-aliased text and lineart 
data in accordance with the third exemplary embodi- 
ment. The method begins in step S2500 and control pro- 
ceeds to step S251 0. In step S251 0, a standard scan is 
performed to generate the binary image data at high res- 
olution. Control then proceeds to step S2520. In step 
S2520, boundary pixels are identified and control pro- 
ceeds to step S2530. In step S2530, an estimate of the 
median pixel values on the boundaries is performed and 
control proceeds to step S2540. In step S2530, the es- 
timate is performed based on the observation that, the 
median pixel value varies approximately linearly with the 
number of 4-neighbors of the opposite color. In step 
S2540, the boundary pixels are separated into interior 
and exterior pixels and control proceeds to step S2550. 
In step S2550, an approximate dependency on the 
number of oppositely colored 4-neighbors n is deter- 
mined from the estimated median pixel values and con- 
trol proceeds to step S2560. In step S2560, the gray- 
scale values of the boundary pixels and the binary high 
resolution data are stored in memory. Control then pro- 
ceeds to step S2570. In step S2570, the grayscale 
boundary pixel data and the binary high resolution data 
are used to render the image and control proceeds to 
step S2580. In step S2580, the method ends. 
[0061] The third exemplary embodiment is particular- 
ly effective when it is not possible to generate a high- 
resolution grayscale image, from which pixels can either 
be selected or adapted based on the statistics of the im- 
age. In such a case, it is still possible to perform an es- 
timate of the median pixel values on the boundaries, 
based on the observation that, when segregated by the 



number of 4-neighbors, the median pixel value varies 
approximately linearly with the number of 4-neighbors 
of opposite color. In the third embodiment, the boundary 
pixels are separated into interior and exterior pixels. 
s [0062] An approximate dependency of the boundary 
pixel gray value on the number of oppositely colored 
4-neighbors n can be determined from median value 
curves, such as those in Figures 1 9 and 20, by making 
a linear fit to the data end points (for 0 and 4 neighbors). 
io Therefore, for interior pixels, the pixel value for n = 0 is 
always assumed to be a fraction, for example, 0.45. of 
the threshold. Also, for n = 4, the pixel value is equal to 
the threshold. Hence, for all values of n, 

15 val in = 0.45T + (0.138T)n 

where T is the threshold. Likewise, for exterior pixels, 
the pixel value for n = 0 is always taken to be equal to 
20 T plus a fixed fraction, for example, 0.47, of the differ- 
ence between 255 and T, which reduces to 1 1 9 + 0.53T. 
Again for n = 4, the pixel value is T. Hence, the linear 
interpolation formula for all values of n is: 

25 va, out = 119 + 0.53T + (0. 1 1 7T - 30)n 

When these values are used in place of the (unknown) 
actual median values, the image quality is reasonably 

30 good. Using this method, the magnified fragment is 
shown in Figure 26 and the 4x subsampled image is 
shown in Figure 27. Figure 26 specifically shows a mag- 
nified MRC image with a grayscale mask boundary us- 
ing non-adapted linearly interpolated approximation to 

35 4-connected median pixel values. Figure 27 shows a 
MRC image with a grayscale mask boundary using non- 
adapted linearly interpolated approximation to 4-con- 
nected median pixel values after 4x sampling. 
[0063] In a fourth exemplary embodiment, non-adap- 

40 tive, connectivity-dependent global boundary pixels are 
assigned. This is performed by assigning interior bound- 
ary pixels a specified global value, and assigning exte- 
rior boundary pixels a specified global value. These val- 
ues can be determined based on the median values in 

45 the scanned image, or assigned based on analysis of 
similar images. A standard scanning scheme, e.g., bi- 
nary high resolution, is used to produce high resolution 
binary data, e.g., 300 - 400 ppi. The method is not adap- 
tive. Therefore, global pixel value assignments are not 

so computed from the scanned image. Furthermore, con- 
nectivity is not utilized. Therefore, the values of connect- 
ed neighbors of a boundary mask pixel are not used in 
determining the pixel value assignments. The resulting 
boundary pair is implicit, which means the extra mask/ 

55 image pair is constructed to render the image. 

[0064] Figure 28 shows a flowchart indicating the flow 
of a method for providing anti-aliased text and lineart 
data in accordance with the fourth exemplary embodi- 
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ment The method begins in step S2800 and control pro- 
ceeds to step S2810. In step S281 0, a standard scan is 
performed to generate the binary image data at high res- 
olution. Control then proceeds to step S2820. In step 
S2820, the boundary pixels are identified and control 
proceeds to step S2830. In step S2830, the boundary 
pixels are separated into interior and exterior boundary 
pixels and control proceeds to step S2840. In step 
S2840, a first global grayscale value is set f orthe interior 
boundary pixels and a second global grayscale value is 
set for the exterior boundary pixels. Control then pro- 
ceeds to step S2850. In step S2850, the first global gray- 
scale value is assigned to all interior boundary pixels 
and the second global grayscale value is assigned to all 
exterior boundary pixels. Control then proceeds to step 
S2860. In step S2860, the grayscale boundary pixel val- 
ues assigned in step S2850 and the binary high resolu- 
tion data provided in step S281 0 are stored in memory 
as compressed data. Control then proceeds to S2870, 
in which the grayscale boundary pixel values and the 
binary high resolution data are stored in memory. Sub- 
sequently, control proceeds to step S2880, in which the 
grayscale boundary pixel values and the binary high res- 
olution data are used to render the image and control 
proceeds to step S2890. In step S2890, the method 
ends. 

[0065] Implementation of the fourth exemplary em- 
bodiment is performed when high resolution grayscale 
data is unavailable and simplification of the method for 
performing anti-aliasing of the text or lineart mask is of 
heightened importance, for example, in a rendering en- 
gine that does not have time or computational ability to 
determine the number of oppositely colored adjacent 
pixels tor each boundary pixel It is worth noting that at 
some degree of simplification, rendering results are suf- 
ficiently poor so that it makes no sense to simplify fur- 
ther. In this exemplary embodiment, connectivity is ig- 
nored, a global value is assigned to all interior boundary 
pixels and another global value is assigned to all exterior 
boundary pixels. Therefore, only two parameters are 
chosen, and these global values can be taken as the 
median values of representative boundary pixels from 
other images in which the high resolution gray value is 
known. These values are found to be approximately va- 
1^ = 0 62T and val oul =77 + 0.70T. The former is deter- 
mined assuming the median value scales with threshold 
T as aT, and the latter is determined assuming the me- 
dian value scales with threshold as T + a(255 - T). These 
are the same forms as were used in the third exemplary 
embodiment for fitting the point n = 0. 
[0066] The results are illustrated in Figure 29 for the 
magnified fragment and in Figure 32 for the 4x subsam- 
pled image. For these images, the interior boundary pix- 
els are set to a threshold value of 95 arid the exterior 
ones are set to a threshold value of 1 83. The results are 
better than using a binary text or lineart mask, but worse 
than the other methods. The advantage here is that it is 
not necessary to compute the connectivity of each pixel 



in the boundary. 

[0067] In a fifth exemplary embodiment, a direct 
scale-to-gray is performed from a very high resolution 
(600 ppi or more) binary mask. A special scanning 

s scheme, e.g., either binary very high resolution or gray- 
scale-high resolution, is used to produce very high res- 
olution binary data, e.g., 600 ppi or greater. The method 
is adaptive, and local boundary pixel value assignments 
are computed from the scanned image. The values of 

10 connected neighbors of a boundary mask pixel are not 
used in determining the pixel value assignments. There- 
fore, there is no need to use an extra mask/image pair 
to render the image. 

[0068] The fifth exemplary embodiment of the inven- 
ts tion is the only exemplary embodiment that does not 
clearly fit within the MRC framework. Figure 31 shows 
a flowchart indicating the flow of a method for providing 
anti-aliased text and lineart data in accordance with the 
fifth exemplary embodiment The method begins in step 
20 S3100 and control proceeds to step S3110. In step 
S3110, very high resolution binary text or lineart mask 
data, of 600 ppi or greater, is obtained. The very high 
resolution binary text or lineart mask data can be ob- 
tained from a 300 ppi grayscale image, or scanned di- 
25 rectly and control proceeds to step S3120. In step 
S31 20, the text or lineart data is then filtered and control 
proceeds to step S31 30. In step S31 30, the filtered data 
is down-sampled to 300 ppi gray. Subsequently, control 
proceeds to step S3140, in which thresholding is per- 
30 formed on the high resolution grayscale pixel data to bi- 
narize the image data. Control then proceeds to step 
S3150, in which boundary pixels are identified using the 
binary image data produced in step S31 40. Control then 
proceeds to step S3 1 50, in which the high resolution bi- 
35 nary image data and the grayscale boundary pixel data 
is stored in memory as compressed data. Control then 
proceeds to step S3170, in which the compressed data 
is used to render the image and control proceeds to step 
S3180. In step S3180, the method ends. 
40 [0069] The first step -the reduction from very high 
resolution binary to high resolution gray- is often called 
M seal e-to-g ray", and is typically done by tiling the image 
preparatory to subsampling, and calculating a gray val- 
ue for each tile that is proportional to the number of ON 
45 pixels in the tile. In this case, the filtering operation is to 
take a convolution with a constant filter of size equal to 
the tile. A number of very fast conventional algorithms 
have been developed for this operation. Additionally, 
other methods of generating the 300 ppi gray data are 
50 useful. For example, the very high resolution binary im- 
age may be generated by performing a 2x grayscale ex- 
pansion on input 300 ppi grayscale image date to res- 
cale from 300 ppi to 600 ppi. Subsequently, the usual 
thresholding would be performed to generate the very 
55 high resolution binary mask which is stored in the file 
format, and on which the scale-to-gay operation is per- 
formed as in this exemplary embodiment. To perform 
anti-alias boundary pixel rendering from this high reso- 
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lution grayscale text or lineart mask, white also filtering 
and smoothing the background, the grayscale text or lin- 
eart mask is placed over the unfiltered original back- 
ground, as follows: (1) lowpass the grayscale text or lin- 
eart mask with a very high threshold value, e.g., 250, so s 
that all gray pixels associated with the mask are kept; 
(2) clean the background with the usual grayscale dila- 
tion; and (3) select the gray pixels from the grayscale 
mask that are under the lowpassed binary version of this 
mask. 10 
[0070] The magnified image fragment resulting from 
rendering in accordance with the fifth embodiment is 
shown in Figure 32. Because the grayscale pixels form 
a boundary that is only one pixel wide, instead of the 
usual two-pixel grayscale boundary from the original 15 
and the other methods, the boundary edges appear to 
be sharper. For example, a 4x sub-sampled image is 
shown in Figure 33. It appears sharperthan images anti- 
aliased by other means because of the narrow boundary 
region, and it is comparable legibility to the other meth- 20 
ods. In addition to the lack of fuzziness, the boundary 
pixels have a limited dynamic range, of five levels for 
the example shown where a 600 ppi binary mask is re- 
duced to a 300 ppi grayscale mask. The grayscale text 
or lineart mask is quantized to five levels, which requires 
little more than two bits of data, because a 2x sale-to- 
gray conversion is performed from a binary mask. 
[0071] In the MRC format, the binary mask would be 
stored in a file. However, storage for the 600 ppi binary 
mask in the file scales approximately linearly with the 
resolution. Therefore storage requires about double the 
storage space of the usual 300 ppi text or lineart mask. 
Thus, for rendering, the stored mask is scaled to gray 
at the high resolution (300 ppi), and all pixels that are 
not pure white are incorporated in the rendered image, 
as described above. 

[0072] As mentioned above, the exemplary embodi- 
ments of the invention may be implemented in combi- 
nation with lossy text or lineart mask compression by 
tokens. For example, the present invention may be im- 
plemented with lossy compression by connected com- 
ponent tokens. Conventionally, such tokens are binary 
images that are tiled to compose the text or lineart mask, 
i.e., the MRC foreground mask. Conventionally, these 
tokens, i.e., binary tokens, are generated from a bina- 
rized image, typically at 300 ppi. 

[0073] A binary token is usually taken, in the unsuper- 
vised classification scheme, to be the first token in- 
stance found of a new class of connected components. 
It is possible to form a composite grayscale token, by 
suitably aligning_and averaging the pixel values in all 
token instances of a particular class. However, this ap- 
proach has several major problems: alignment is tricky; 
it is computationally intensive and if there is only one 
token instance, the token remains binary. 
[0074] Use of grayscale tokens alleviates the token 
substitution baseline jitter problem because gray pixels 
around the boundary of the token reduce the visual ef- 



fect of hard pixel boundaries that can vary by ± 1 pixel 
in the standard token replacement algorithm. It also al- 
leviates the anti-aliasing problems on the text or lineart 
mask in general. Each grayscale token can be consid- 
ered to have three regions: foreground (ON pixels), 
boundary (gray pixels) and the remaining (OFF) pixels. 
The foreground and boundary layer pixels are non-in- 
tersecting. For implementing some of the exemplary 
embodiments of the invention, it may be necessary to 
decompose the boundary layer into two parts: an interior 
part, e.g.. ON pixels in the binary mask, and an exterior 
part, e.g., OFF pixels in the binary mask. In no case does 
the boundary need to be saved, because it is either de- 
fined implicitly as the set of boundary pixels in the text 
or lineart mask or it can be derived from the saved gray- 
scale pixels. It is worth noting, that the grayscale bound- 
ary pixels, when saved, achieve most of their effect from 
the 2 or 3 most significant bits. 

[0075] Typically, the binary tokens are substituted for 
the token instances using an alignment based on the 
bounding boxes of each. For example, the lower-left 
comer of the token is placed at the lower-left comer of 
the bounding box of the token instance. Special efforts 
can be made during compression to align the binary to- 
ken individually with each token instance in the image, 
based on pixel locations, e.g., by finding an alignment 
that minimizes the pixel-wise XOR operation or maxi- 
mizes the pixel-wise AND operation performed on the 
two images, or that aligns the center of masses to the 
nearest pixel. However, if special alignment techniques 
are not performed, the resulting baseline jitter will be no- 
ticeable. This is because the bounding box for the set 
of token instances can be displaced one pixel up or 
down (but not both) relative to the best visual alignment 
during the thresholding process. Then successive to- 
kens in a line of text or lineart can be vertically displaced 
from the best alignment by up to two pixels with respect 
to each other because one token can be displaced up- 
wards and the next downwards. 

[0076] A random set of relative displacements, with a 
maximum of two, is visible as baseline jitter. If the tokens 
are optimally aligned with each instance of the token, 
jitter is significantly reduced, but the maximum relative 
displacement is still about one pixel. When the tokens 
are rendered with grayscale anti-aliasing pixels on the 
boundary, the observable jitter is significantly reduced, 
whether or not each binary token has been optimally 
aligned with all of the token instances. It is foreseeable 
that the observable reduction in baseline jitter using an- 
ti-aliased tokens is about a factor of two, depending on 
the specific method of generating the anti-aliasing pix- 
els. Thus, the maximum relative baseline jitter will be 
about one pixel for unaligned tokens and about 0 .5 pixel 
for aligned tokens. The jitter in the former case may not 
be noticeable, and it will surely be unobservable in the 
latter. 

[0077] Each of the exemplary embodiments de- 
scribed above may be implemented in conjunction with 
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grayscale token compression. The changes from using 
full image masks are relatively small. Because the 
number of tokens will be limited, the extra storage re- 
quired for these grayscale boundary pixels must be less 
than that required if the full mask is stored rather than 5 
the tokens. 

[0078] In conjunction with the first exemplary embod- 
iment, the boundary pixels are selected from the original 
image, appropriately quantized and each token ana- 
lyzed as two separate layers: foreground and boundary. io 
These layers constitute two images that can be com- 
pressed separately The foreground image is binary and 
consists of the standard text or linear! mask, eroded to 
remove the interior boundary pixels. However, if the or- 
der of rendering of the layers is specified and the fore- 15 
ground is rendered first and the boundary second, it is 
not strictly necessary to erode the foreground text or lin- 
ear! mask because the interior boundary pixels will paint 
over them. 

[0079] As explained above in the detailed explanation 20 
of the first exemplary embodiment, the boundary pixels 
in the token can be stored as either a full image or as a 
packed array of pixels that are just the boundary pixels 
encountered in a raster scan order. When stored as a 
full image, the boundary image is grayscale , with the 25 
non-boundary pixels taking on a constant value, e.g., 
zero. When compressed by a universal source coder, e. 
g., Lempel-Ziv, few bits will be allocated to the non- 
boundary pixels. Nevertheless, the packed array of 
boundary pixels will always compress better than the full so 
image. 

[0080] In conjunction with the second exemplary em- 
bodiment, the analysis of grayscale pixel values based 
on the value in the high resolution grayscale image and 
the nearest neighbor colors in the thresholded mask is 35 
performed as explained above. The grayscale tokens 
are rendered by analyzing the connectivity of the bound- 
ary pixels and making the appropriate grayscale substi- 
tutions. There is no need to store an explicit layer (rep- 
resentation) of the boundary pixels. Again, only nine or *o 
ten numbers need to be stored when 4- or 8-neighbor 
connectivity is used, and these grayscale values are 
used in all the tokens. 

[0081] In conjunction with the third exemplary embod- ' 
iment, the grayscale tokens are as explained above, ex- 45 
cept that the substitution values are known a priori, and 
are not determined from the image being compressed. 
Rather, the substitution values are specified depending 
on whether the pixel is in the interior or exterior bound- 
ary, and on its 4- or 8-neighbor connectivity. The binary so 
text or lineart mask is saved, with no extra data, except 
perhaps for the threshold value that was used to gener- 
ate the text or lineart mask. It is not necessary to do a 
high resolution grayscale scan. 

[0082] In conjunction with the fourth exemplary em- ss 
bodiment, the boundary grayscale pixel values are as- 
signed as explained above based on constants for the 
interior and exterior boundary, independent of 4- or 



8-neighbor connectivity. As in the implementation of the 
third exemplary embodiment with grayscale token com- 
pression, only the threshold value needs to be saved in 
the file, but the appearance of these tokens is visually 
inferior because each boundary layer is given a single 
color. It is not necessary lo do a high resolution gray- 
scale scan. 

[0083] In conjunction with the fifth exemplary embod- 
iment, the tokens are stored at very high resolution (600 
ppi or greater), and the location at which the tokens are 
placed in the image is also stored at this resolution. 
When the image is to be rendered at lower resolution, 
a scale-to-gray transform is done separately on each to- 
ken. As described above in the description of the fifth 
embodiment, the gray anti-aliasing pixels form a bound- 
ary that is one (rather than two) pixels wide, and the edg- 
es thus appear sharper. The compressed tokens require 
approximately twice the storage for the image parts, but 
the location information for the tokens is not substantial- 
ly increased. Thus, the increase in storage when using 
tokens at 600 ppi instead of 300 ppi is much less than 
a factor of two, which was the increase predicted in the 
implementation of the fifth exemplary embodiment 
where the very high resolution binary text or lineart mask 
is compressed as a full image. 

[0084] The exemplary embodiments of the present in- 
vention may also be implemented to generate anti-ali- 
ased text and line graphics in color images. In a typical 
MRC format, both the foreground and background are 
represented by low-resolution color images. 
[0085] The first exemplary embodiment uses the ac- 
tual scanned values of the boundary pixels. This pro- 
vides the best results for the rendered boundary pixels, 
but from a storage view, it is the most storage-wasteful 
method. When tokens are used, only one set of bound- 
ary pixels is stored for each token. For grayscale imag- 
es, this is straightforward, as described above. Howev- 
er, for color, the fact that each token instance has a dif- 
ferent set of colors in both the foreground and back- 
ground and requires adaptation of the method for gen- 
erating a nti -aliased text data. 

[0086] A simple method for color anti-aliased tokens 
in accordance with the first exemplary embodiment can 
be used if a color scan at high resolution, e.g. : 300 ppi, 
is available. Therefore, implementation of the first ex- 
emplary embodiment may require that foreground and 
background images, which are typically scanned or 
saved at low resolution, have been scaled appropriately 
so that corresponding pixels for each location can be 
identified. For each high resolution color boundary pixel 
on a token, the three colors are stored as three gray val- 
ues. Each gray value is computed as a fraction of the 
gray, i.e., intensity between low-frequency foreground 
and background images at that location. For example, 
if a red intensity is halfway between foreground and 
background red intensities, the red gray value is taken 
to be 0.5. It also foreseeable that the gray value can be 
stored with quantized values between 0 and 2 - 1 , for 
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some value of n. These three gray values are thus used 
for interpolation between foreground and background 
pixels on the boundary pixels of each instance of the 
token. 

[0087] Assume that the foreground and background 
pixel saturation values for red are S r fore and S r back> and 
similarly for blue and green. Also assume the boundary 
pixel saturation value for red is S r bdry . Then the fraction 
sr bdr/ ( sr fore ' sr back) is stored and the red component 
of this boundary pixel in each token instance is rendered 
using this fraction and the local values of S r fore and S r - 

back- 

[0088] However, it may not be practical to scan the 
image with high resolution color so that these three color 
values are not available separately for each boundary 
pixel. Therefore, assume instead that only a high reso- 
lution grayscale image is available. Each boundary pixel 
is a weighted average of the contributions from the 
three, unavailable color values. In such a situation, this 
grayscale value can be used to store a single value for 
each token pixel. Specifically, the weighting factors ap- 
propriate to the scanner are used to find the weighted 
average saturations of the nearest pixels in the fore- 
ground and background color images. These saturation 
values are then compared with the high resolution gray- 
scale value for the boundary pixel. Subsequently, for 
each boundary pixel on the token , an interpolation factor 
between foreground and background is stored. Howev- 
er, in this implementation, it is a single number. For ren- 
dering, this interpolation factor is selected for the bound- 
ary pixel of each token instance, and is used to find the 
red, green and blue intensities by interpolating between 
the foreground and background low-resolution color im- 
ages. 

[0089] The crucial element is that the boundary pixels 
of the token are labeled with parameter values that allow 
an approximate reconstruction of the actual color for 
each token instance, which has different foreground and 
background colors. A linear interpolation, as described 
above, is simple, but other weighted interpolation meth- 
ods could also be used. 

[0090] Although the application of the first exemplary 
embodiment to color images has been presented, one 
of ordinary skill in the art would readily recognize how 
the second through fifth embodiments may be imple- 
mented to provide color anti-aliased tokens. The basic 
idea when using tokens is to save just the gray value, 
or if appropriate, three gray values, and to use this in- 
formation to determine how similarthe rendered bound- 
ary pixel is to the foreground and background pixels in 
that vicinity. 

[0091] When using the full image mask instead of to- 
kens, the first exemplary embodiment can be imple- 
mented in two different ways. The straightforward meth- 
od is to save and use the color boundary pixels directly. 
However, if only the grayscale value is available, the 
grayscale data can be used to interpolate between fore- 
ground and background, as described above fortokens. 



When implementing the second through fifth embodi- 
ments in conjunction with the full image mask, interpo- 
lation must be performed using the computed gray val- 
ue, since the actual color or grayscale values of the high 

5 resolution boundary pixels are not available. 

[0092] The exemplary embodiments of the present in- 
vention provide methods and systems for adding anti- 
aliasing to MRC text or lineart masks. These methods 
improve the appearance of a resulting MRC image, both 

10 on display, at high resolution and under typical viewing 
conditions which are sub-sampled, and on printed out- 
put Although the experimental result examples shown 
in some of the Figures indicate potential display output 
for the various exemplary embodiments, the ability to 

15 capture on paper the quality of the visual image as it 
would appear on a 80 ppi display is limited. For example, 
a binary 300 ppi printer was used to render the images 
and the rendering algorithm PostScript™ applied half- 
tone filters to render the gray pixels. Each gray pixel on 

20 the display is rendered approximately by a 4x4 halftoned 
region. 

[0093] Additionally, although methods of image ren- 
dering have been explained above to indicate that com- 
pressed data is stored to memory 540 and subsequently 

25 decoded and used to render images, it is not essential 
that the compressed data be stored in memory 540. 
Rather, it is foreseeable that the compressed data pro- 
vided by the encoder 530 may be provided directly to 
the decoder 530 to reduce the amount of data to be 

30 transmitted to the decoder, e.g., when the decoder is 
actually part of an image renderer. Such an application 
would be beneficial when data space in the connection 
to the image render is limited. 

35 

Claims 

1. An image rendering system comprising: 

40 a scanner that scans an image and produces 

image data; 

an encoder that is coupled to the scanner and 
encodes the image data to provide encoded im- 
age data including anti-aliased grayscale text 
45 or lineart data that includes an identification of 

boundary pixels and associated pixel values; 
and 

a decoder that is coupled to the encoder and 
decodes the encoded image data to provide de- 
50 compressed data including anti-aliased text or 

lineart data and renders the decompressed da- 
ta. 

2. The image rendering system of claim 1 , wherein the 
55 system implements an MRC image architecture. 

3. The image rendering system of claim 1 or claim 2, 
wherein the scanner scans an image and produces 
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high resolution grayscale data, and wherein the en- 
coder separates the grayscale pixels of the high 
resolution grayscale data into boundary pixels and 
non-boundary pixels, individually derives values of 
grayscale boundary pixels using the high resolution s 
grayscale data, and stores the individually derived 
values of the grayscale boundary pixels. 

4. The image rendering system of claim 3, wherein the 
encoder compresses the individually derived val- io 
ues of the grayscale boundary pixels as part of a 

set of grayscale tokens that represent connected 
components in a foreground image that is part of 
the scanned image data. 

15 

5. The image rendering system of claim 4, wherein the 
image data is color image data and a foreground 
image and a background image of the scanned im- 
age data are represented by low- resolution color 
data. 20 

6. The image rendering system of claim 4, wherein, 
during compression, the encoder selects the 
boundary pixels from the scanned image data, 
quantizes the boundary pixels and analyzes each 25 
token as both a foreground layer and a boundary 
layer. 

7. The image rendering system of claim 6, wherein the 
foreground layer and the boundary layer are com- 30 
pressed separately. 

8. The image rendering system of claim 4, wherein on- 
ly one set of boundary pixels Is stored for each to- 
ken. 35 

9. The image rendering system of claim 5, wherein, 
for each high resolution color boundary pixel on a 
token, three colors are stored as three grayscale 
values. 40 

10. An image rendering method comprising: 

scanning an image to obtain scanned image 
data including text or linear! data; *5 
generating an anti-aliased grayscale version of 
the text or lineart data inlcuding determining 
pixel values of the boundary pixels in the anti- 
aliased grayscale version of the scanned text 
or lineart data; 50 
rendering the image using the determined pixel 
values. 



55 
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